Linguistic Error Correction Of Japanese Sentences

نویسندگان

  • Tsutomu Kawada
  • Shin'ya Amano
  • Kunio Sakai
چکیده

This paper describes a newly developed linguistic error correction system, which can correct errors and rejections of Japanese sentences by using linguistic knowledge. Conventional optical character readers (OCR) need human assistance to correct their recognition errors and rejections. An operator must teach the OCR correct answers whenever an illegible character pattern occurs. If this error correction operation is mechanized, the throughput of the OCR will increase. This linguistic error correction system offers means of automated error correction by analysing sentences of the OCR outputs linguistically. This system grammatically selects legal letters from the candidates which can not be decided uniquely by pattern recognition only, and recommends grammatically and semantically meaningful letters for the illegible letter.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generating artificial errors for grammatical error correction

This paper explores the generation of artificial errors for correcting grammatical mistakes made by learners of English as a second language. Artificial errors are injected into a set of error-free sentences in a probabilistic manner using statistics from a corpus. Unlike previous approaches, we use linguistic information to derive error generation probabilities and build corpora to correct sev...

متن کامل

Grammar Error Correction Using Pseudo-Error Sentences and Domain Adaptation

This paper presents grammar error correction for Japanese particles that uses discriminative sequence conversion, which corrects erroneous particles by substitution, insertion, and deletion. The error correction task is hindered by the difficulty of collecting large error corpora. We tackle this problem by using pseudoerror sentences generated automatically. Furthermore, we apply domain adaptat...

متن کامل

An Efficient OCR Error Correction Method for Japanese Text Recognition

OCR error correction using Japanese morphological analysis contains two time-consuming procedures: extraction of candidate words from combinations of candidate characters, and finding the most plausible word sequence in combinations of the candidate words. In this paper an optimal word extraction technique, and the use of lexical entries that are tailored for Japanese verb inflection, are inves...

متن کامل

Two modes of assessment: the case of academicians' writing

This study attempted to investigate writing problems and the relationship between expert-assessment and self-assessment of writing problems. Participants were thirty four non-English faculty members of Tehran and Guilan universities. The instruments were writing an essay on the topic "What teaching strategies do you use in your classes?" in twenty five lines and filling the questionnaire of wri...

متن کامل

Analysis of voice fundamental frequency contours for declarative sentences of Japanese

A model for the generation of fundamental frequency contours (F0contours) of spoken, sentences is presented for the purpose of elucidating the relationship between the sentence F0 contour and the linguistic and non-linguistic information. It is based on a quantitative formulation of the process whereby the logarithmic fundamental frequency is controlled in proportion to the sum of two component...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1980